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1 Introduction 

For every 9 > 0, consider a sequence Pi, P2, • • • of i.i.d. random variables with the beta(l, 9) 
distribution, i.e. P($ G du) = 0(1 - u) e ~ l du with u e]0, 1[. The Residual Allocation Model 
defined by the sequence of random variables 



has the Griffiths- Engen-McClosey distribution with parameter 0, GEM(0). Rearranging the 
terms of the sequence (£ n )ngN in the decreasing order yields a random variable ^ with values 
in the space of (proper) partitions of the unit mass 



The ranked sequence £} has a Poisson-Dirichlet distribution, which is denoted here by PD(0, 0). 



n 



6 = A , 6 = &(1 -Pi) , ■■■ , £n+l = Pn+1 X ~ Pi) , 




x = (x\, X2, ■ . .) : X\ > x-i > ■ ■ ■ > and 
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The Poisson-Dirichlet laws PD(0, 9) form a one-parameter family of probability measures on 
V\ that play an important role in a variety of areas. Let us just mention, first for 9 = 1, limit 
theorems for large combinatorial structures (e.g. decomposition of permutations into cycles, 
or of integers into prime factors), and then for general 9 > 0, Bayesian statistics, population 
genetics ... We refer to the book [TJ by Arratia, Barbour and Tavare for precise references and 
much more on this topic. The motivation for the present note stems from the fact that PD(0, 9) 
arises as invariant distribution for certain split and merge transformations; see Tsilevich [20J, 
Gnedin and Kerov [9j, Pitman [15], Diaconis et al. [6], Mayer- Wolf et al. [11], Schramm [IB] , ... 
Specifically, consider the discrete time Markov chain with values in V\ whose transitions can 
be described as follows. When the chain starts from some configuration x = (xj)j g N, pick 
two i.i.d. random integers N and N' according to the law F(N = i) = ¥(N' = i) = X{. If 
N N', then merge x^ and x^i, otherwise split the term xn = xw uniformly at random, 
and in both cases, leave the other terms of x unchanged. Finally, reorder the resulting terms 
in the decreasing order to get a partition of the unit mass. The Poisson-Dirichlet distribution 
PD(0, 1) is the unique invariant (in fact, reversible) probability measure for this split and merge 
evolution. More generally, any PD(0, 9) for 9 > appears as the invariant law of a variation 
of the preceding dynamics. In this direction, we mention that the question of the existence of 
reversible laws for a large class of coagulation-fragmentation processes with values in certain 
finite subsets of V\ (see [10J [21] for some literature on this topic) has been solved by Durrett 
et al. [7J and Erlihson and Granovsky [HJ. However, due to the requirement of finiteness, it 
does not seem that the latter results can be applied to tackle the problem of existence and 
uniqueness of the invariant distribution for the split and merge transformation above. 

Perman, Pitman and Yor [13] extended the notion of Poisson-Dirichlet distributions to a 
two-parameter family PD(a, 9) with < a < 1 and 9 > —a. One quick way for this is to apply 
the Residual Allocation Model above to the situation where Pi, fa, ■ ■ ■ are again independent 
beta variables, but now j3 n has parameter (1 — a, 9 + na), viz. 

F((3 n e du) = B(l - a, 9 + na)~ l u~ a (l - uf+^^du for < u < 1. 

This two-parameter family appears naturally in connection with excursion lengths of Bessel 
processes and their bridges; we refer to the survey by Pitman and Yor [13J for a detailed 
account on this topic. The purpose of this note is to point out that for any < a < 1 and 
9 > —a, the Poisson-Dirichlet distribution PD(a, 9) also arises as the unique reversible law for 
a certain quite natural fragmentation-coalescence process. 

The fragmentation-coalescence processes that we shall deal with here are not directly re- 
lated to the split and merge transformation, but rather belong to the family of Exchangeable 
Fragmentation-Coalescence (EFC) processes which have been introduced recently by Berestycki 
[3]. Roughly speaking, EFC processes evolve in continuous time and combine the dynamics of 
coalescents with simultaneous multiple collisions (see [T2l [19] or Chapter 4 in 0) and that of 
homogeneous fragmentations (see [I] or Chapter 3 in [5]). We stress that in general, they are 
not step Markov processes, in the sense that their first exit from an arbitrary configuration 
is instantaneous a.s. In this work, collisions in coagulation events are multiple (in fact, every 
coagulation event involves an infinite number of terms a.s.) and coagulations are simple, in the 
sense that all the terms involved in a coagulation event merge in a single term. More precisely, 
using the terminology of Pitman [H] (see also Sagitov [17]). the rates of multiple collisions 
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are governed by the measure Ag/ a (du) = (1 — u) e l a du for u g]0, 1[. The rates of dislocations 
are in turn governed by an infinite measure of Poisson-Dirichlet type denoted by PD(ct, —a), 
which has been introduced recently by Basdevant [2] in a study of Ruelle's cascades (note that 
(a, —a) is excluded from the range of parameters for Poisson-Dirichlet probability measures). 
It is interesting to observe that the coagulation rates only depend on the ratio 9/ a whereas 
those for dislocations only depend on the parameter a. 

2 Some background 

The purpose of this section is to provide a rigorous presentation of notions which have been 
discussed informally above. We thus now introduce the material which will be needed to state 
and prove our main result in the next section. Exchangeability is one of the major ingredients. 
We first recall the key connexion between random mass-partitions and random exchangeable 
partitions of N due to Kingman, and an important sampling formula for Poisson-Dirichlet 
distributions due to Pitman (we refer to e.g. Section 2.3 in [5] for a complete account). Then 
we briefly present some main features on EFC processes which are gleaned from [3]. 

2.1 Exchangeable random partitions 

A partition tt of N := {1, 2, . . .} is an infinite sequence of pairwise disjoint blocks 7Ti, 7r 2 , . . . which 
are ordered according to the increasing order of their least element, and such that [J ien TTi = N. 
The space of partitions of N is denoted by Vn. For every n G N, the restriction to [n] := 
{1, . . . ,n} induces is a natural projection tt — > 7T|[ n ] from to ?u, the space of partitions of 
[n). Plainly, the sequence of restricted partitions (7ri[ n i,n G N) is compatible, in the sense that 
for every integer n > 2, mui is a partition of [n] whose restriction to [n — 1] coincides with 
7T][n.-i]- Conversely, for any sequence of compatible partitions 7r^ G V[ n ] for n G N, there exists 
a unique partition n E Vn such that 7T|[„] = tt^. 

A random partition of N is called exchangeable when its distribution is invariant by the action 
of permutations on Vn- One can associate to every partition of the unit mass x = (xi, . . .) an 
exchangeable random partition tt by the following construction, which is often referred to as 
the paint-box process. The partition of the unit mass x is viewed as a (proper) probability 
measure on N. One introduces a sequence of i.i.d. discrete variables rj 1 ,r]2,... distributed 
according to x, i.e. P(?7„ = i) = Xi, and then defines a random partition tt of N by declaring 
that two integers k, £ belong to the same block of tt if and only if rjk = rjg. Note that tt 
contains no singletons a.s., and, by the strong law of large numbers, that every block of tt, say 
B, possesses an asymptotic frequency \B\ := lim^oo ra _1 Card(5 n [n]). Further, the sequence 
\tt\ = (|7Tj|,i G N) of the asymptotic frequencies of the blocks of tt is a size-biased reordering of 
the sequence x. Conversely, any exchangeable random partition tt which contains no singleton 
a.s. is distributed as a mixture of such paint-boxes. In particular, all the blocks of tt possess 
asymptotic frequencies a.s., and their rearrangement in the decreasing order yields a random 
(proper) partition \tt\+ = x of the unit mass whose law characterizes that of tt. 

The distribution of any exchangeable random partition tt is determined by an Exchangeable 
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Partition Probability Function (EPPF) 

p : [J N fe -> [0, 1] , p(ni,...,n fc )=P(7r| [n] =7(ni,...,n fc )), 

where n = ni + • • • + n& and 7(711, . . . ,rik) denotes an arbitrary partition of [n] which has 
fc non-empty blocks with cardinals ni, . . . (by the exchangeability property, the preceding 
probability does not depend on the choice for such a partition 7(711, . . . , n k ), and in particular 
p is a symmetric function of the variables n^). 

We next focus on Poisson-Dirichlet distributions, and fix a pair of parameters (a, 9) with 
< a < 1 and 9 > —a. Let x Q g be a random partition of the unit mass with law PD(a, 9) 
and ~K a fi a random partition of N distributed as a paint-box based on x Qj e. The EPPF p a g of 
the PD(a, 9) random partition 7r a> g is given by Pitman's sampling formula : 

. . {9l(x)kt t— r . . , . 

p ai e(7ii,...,7ijfc) = — — \\-{-oc) ni u (1) 

{0U 7=1 

where for every integer i and real number a Z_, 

(a) £T = r(a + ^)/r(a) = a(a + 1) • • • (a + £ - 1), 
and for 9 = 0, the ratio in the right-hand side of (pQ) is understood as (A; — 1)!/ (n — 1)!. 



2.2 Exchangeable Fragmentation-Coagulation processes 

This section essentially follows Berestycki [3], with some minor modifications which are made 
to adapt the presentation to our purposes. 

A Markov process II = (U(t),t > 0) with values in is called an EFC process if it is 
exchangeable (i.e. its law is invariant by the action of permutations on P^-valued processes), 
and if for every integer n, its restriction II|r n ] to V\ n ] is a continuous-time Markov chain that 
only evolves by fragmentation of one block or by coagulation. This means that when II|[ n ] starts 
from some configuration, say 7r' n ^ G V\ n ], then the only sites at which it can make its first jump 
are those that can be obtained from 7r™ by splitting exactly one of its blocks (which thus has 
at least two elements), or by merging at least two of its non-empty blocks. It is important 
to observe that an EFC process II can be recovered from the family of its restrictions nir n i, 
and thus its evolution is entirely determined by the jump rates of a family of continuous-time 
Markov chains with values in some finite space, even though the process II itself is not a step 
process in general. It is easily seen that, with the obvious exceptions of the cases when the 
fragmentation or coagulation component is degenerate, the restrictions II|[ n ] are irreducible and 
thus possess a unique invariant distribution. 

The evolution of an EFC process may be easier to understand when the latter is viewed as 
an infinite particle system, in which particles correspond to blocks of the random partitions 
n(£). Specifically, as II is an exchangeable process, for every t > 0, all the blocks of the 
partition H(t) have asymptotic frequencies a.s£] We write X(t) for the partition of the unit 

1 In fact, Berestycki (3] showed the much stronger result that with probability one, the partitions II(t) have 
asymptotic frequencies simultaneously for all t > 0. 
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mass given by the sequence ranked in the decreasing order of the asymptotic frequencies of the 
blocks of II(t). We think of the process (X(t), t > 0) as a particle system in which particles are 
characterized by their masses, and may split or coagulate with other particles. According to 
Theorem 7 of Berestycki [3J, this particle system has the Feller property, and it is easily seen 
that its distribution determines that of II. 

One of the basic results about EFC processes (cf. Propositions 4 and 5 in [3J) states that their 
dynamics are entirely characterized by a pair of measures on the space of mass-partitions, which 
specify respectively the rates of multiple coagulation and of splittings, and two nonnegative 
coefficients, one for the rate of binary coagulations and the other for the erosion. Here, we 
shall only consider the situation where there are no binary coagulation and no erosion, so the 
aforementioned coefficients are both zero. Further, we shall focus on the case when coagulation 
events are always simple, i.e. at any given time, all the particles involved into a coagulation 
event merge in a single new particle. This enables one to represent the measure characterizing 
coagulation rates as some finite measure A on ]0, 1]. The measure on V\ that characterizes 
splitting rates will be denoted by u, it fulfills the conditions 



Informally, the dynamics of the particle system X (and hence also of II) are determined by the 
measures A and v as follows. Coagulation events involving a proportion u g]0, 1] of particles 
(that are chosen uniformly at random) occur with rate u~ 2 A(du), and each particle, say y, 
splits into a sequence of smaller particles yx\,yx2, ■ ■ ■ with rate z/(dx), independently of the 
other particles in the system. Conversely, given an arbitrary finite measure A on ]0, 1] and an 
arbitrary measure v on V\ such that (T5]) holds, there exists an EFC process having only simple 
(but not binary) coagulations, with characteristics A and v. 

This description of the evolution of the particle system X may be intuitively appealing, 
however it would be difficult to make it formal because coagulation (respectively, splitting) 
events occurs immediately whenever J u~ 2 A(du) = oo (respectively, v(V\) = oo). Working 
with the Ppj-valued process II and its restrictions enables one to circumvent this difficulty. 
Indeed, the transitions of the continuous time Markov chains IL^i are defined rigorously in 
terms of A and v as follows. 

We first consider coagulation events. For all integers k, £, n with 2 < k < £ < n, pick an 
arbitrary partition 7 G V\ n \ which has i non-empty blocks, and 7' G V\ n \ another partition 
which is obtained from 7 by a simple coagulation event involving k of its blocks. Then the 
jump rate of Il|[ n ] corresponding to the coagulation event which transforms 7 into 7' is 



These are the only non-zero rates for coagulation events. 

We next turn our attention to splitting events. Pick an arbitrary partition 7 G V[ n ] which is 
not the partition into singletons, and select an arbitrary block B = (ii, . . . , i^) of 7 with k > 2 
elements. Then pick an arbitrary partition r] G Vm which is not trivial (i.e. rj 7^ ([k], 0, . . .)), 
and split B according to rj. This means that we replace the block B by smaller blocks of the type 




(2) 
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{ij, j gC} where C stands for a generic block of rj, and leave the other blocks of 7 unchanged. 
We write 7' for the resulting partition of [n]. Then the jump rate of H\[ n ] corresponding to the 
splitting event which transforms 7 into 7' is 

s n ( 7 , 7 ') = sfo) = / i/(dx)P(7rj[ fc] = 77) , (3) 

where 7r x stands for an exchangeable random partition of N constructed as a paint-box based 
on x. These are the only non-zero rates for splitting events. It is often convenient to rewrite 
(J3J) in terms of EPPF. Specifically, for every partition x of a unit mass, write p x for the EPPF 
of a paint-box based on x. Then 

s(rj) = s(k 1 ,...,k e ) = / z/(dx)p x (4) 

JVi 

where k%, . . . , hi are the cardinals of the (non-empty) blocks of rj. 



3 Some reversible EFC processes 

Here, we fix two parameters < a < 1 and 9 > —a and consider first the finite measure on 
]0, 1] given by 

A e/a (du) = (1 - u) e/a du. 

Next, following Basdevant |2], we introduce an infinite measure PD(a, — a) on V\ as follows. 
We denote as usual the generic partition of the unit mass by x = (x±,X2, . . .), and recall that 
the limit 

L = L(x) = lim nx" (5) 
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exists PD(a, 0)-a.s. Further, it is well-known that for every real number a, 

E afi (L a ) < 00 a > —a , 

where the notation E Q o indicates that the expectation is computed under the law PD(a, 0). In 
this setting, the Poisson-Dirichlet distribution PD(a,9) is absolutely continuous with respect 
to PD(a,0) : 

PD(a - 9) =E5^ PD(a - 0) ' 
Even though E a) o(l/I/) = 00, one can introduce the infinite measure 

PD(a,-a) = L _1 PD(a,0). (6) 

More precisely, Basdevant [2] constructed the measure PD(a, —a) by using a sequence of in- 
dependent beta variables and a variation of the Residual Allocation Model, and showed in 
Theorems 4.4 and 4.6 of [2] that these two constructions are equivalent. Further PD(a,— a) 
fulfills the requirement (T5]). 

Let IIq,^ denote an EFC process with characteristic measures A = Ae/ a and v = PD(a, —a), 
and let X Qj e stand for the particle system given by the process of the asymptotic frequencies 
of blocks of II Qj ranked in the decreasing order. We are now able to state the main result of 
this note. 
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Theorem 1 The Poisson-Dirichlet distribution PD(a, 6) is the unique reversible probability 
measure for the process X a) g. 

Proof: Fix an integer n > 2, and denote by p™$ the probability measure on V[ n \ which is 
given by the image of PD(a, 6) by the paint-box process restricted to [n]. Pick positive integers 
k < n and n 1; . . . , n k such that n\ + • • • + n k = n, and consider an arbitrary partition 7 G V[ n ] 
which has k non-empty blocks with sizes n\,... ,n k - According to Pitman's sampling formula 
(CQ), we have 

k 

[n] ft ~\ \ (#/ a )fct TT ( \ 

Then pick some integer i < k such that rii > 2. Consider some partition 77 of [n,] with 
£ G {2, ...,7ij} non-empty blocks, say with sizes ria, . . . , n^, and denote by 7' G P[ n j the 
partition of [n] which is obtained from 7 by splitting its i-th block according to 77. In particular 
7' has + £ — 1 non-empty blocks with respective sizes n 1; . . . , n.j_i, n$i, . . . , n^, n i+1 , . . . , 
Another application of Pitman's sampling formula (OQ) gives 

p5({7'» = (fl/ g^- T (n-t-^t) (n-(-«)»..t) (-(-«kt)-. 

On the one hand, the sampling formula of Proposition 4.3 in Basdevant [2] states that for 
v = PD(a, —a), the splitting rate s n (7, 7') = s(rj) defined by ([3]) and (jlj) is given explicitly by 

Sn(7,7') = _,_ ; ' II -(-"kt' 

On the other hand, there is a unique way of recovering 7 from 7' by a coagulation event. The 
latter involves exactly I blocks amongst the k + i — 1 blocks of 7', and thus this coagulation 
event occurs with rate 

c„(7',7) = c(fc + £-l,£) = / ^- 2 (l-u) fc -% /Q (d M ) 

7 [0,1] 

= / u l -\l-u) k - 1+9 ' a du 

J [0,1] 

r(i - i)r(Jb + e/a) 
r(fc + £- 1 + 0/a) 

(l-2)!(fl/a)^ 
(6>/a) fc+ £_i T 

Putting the pieces together, we realize that there is the identity 

P5({7}K(7,7') = pL:k{7'})c.(7',7)- (7) 

In other words, for every integer n > 2, the detailed balance equation holds for the Markov 
chain H\[ n ] and the probability measure p™ g . It should now be plain from the results which 
have been recalled in Section 2 that this proves our claim. □ 
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Our proof of Theorem [I] amounts to checking the detailed balance equation ([7j) by inspection 
and does not make the statement quite intuitive. So it may be interesting to explain quickly 
some hints for this result. First, for 6 = 0, Pitman [14] showed that the uniform measure A on 
[0, 1] governs the coagulation rates of the Bolthausen-Sznitman coalescent C BS . Further, the 
logarithmic time-change t — > — hit transforms the Bolthausen-Sznitman coalescent into a time- 
inhomogeneous fragmentation process F R (t) = C BS (— hit), t G]0, 1[, which is closely related 
to Ruelle's cascades. As Basdevant [2] identified the instantaneous dislocation measure of F R 
at time t as t~ 1 PD(t, —t), this suggests that combining fragmentation with rates governed by 
PD(t, —t) and coagulation with rates governed by A might yield a stationary Markov process 
whose invariant law should be given by the distribution of the Bolthausen-Sznitman coalescent 
at time — hit, that is PD(t, 0). For a general 6 > —a, Theorem [1] has its roots in the remakable 
duality identity between coagulation and fragmentation operators based on Poisson-Dirichlet 
variables due to Pitman (see Theorem 12 in [H] or Theorem 4.4 in [5]), combined with intuition 
gained from the preceding analysis in the case 6 = 0. 

We now complete this work by proving the convergence to equilibrium for the EFC process 

Corollary 1 For every configuration x G V\, the distribution of X a> e(t) given X Q , ) e(0) = x 
converges weakly to PD(a, 6) when t — > oo. 

Proof: We consider the process H Q j with values in started from an exchangeable random 
partition given by a paint-box process based on x. In particular, for every t > 0, the law of 
|n Q , i 6i(t)| J -, the sequence of the asymptotic frequencies of the blocks of U aj g(t) ranked in the 
decreasing order, is that of ~K a> g(t) given X Q , e (0) = x. According to Theorem 8 of Berestycki 
[3], we know that n(£) converges in distribution as t — > oo to the equilibrium measure of IT, say 
p a fi. An application of Proposition 2.9 in [5] shows that given Xq.^O) = x, X aj£ )(t) converges 
in distribution as t — > oo to the image of p a> g by the map tt — > \ir\^, which is the stationary law 
for the EFC process X„^- By Theorem [TJ we know that the latter is PD(a, 6). □ 

We stress that, more generally, this argument can be applied to general EFC processes to 
establish the convergence to equilibrium for the mass-partition valued process X. 
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